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ABSTRACT 

We report a discovery of a quasar at z = 4.96 db 0.03 within a few Mpc of the quasar SDSS 
0338+0021 at z = 5.02 ± 0.02. The newly found quasar has the SDSS i and z magnitudes of 
« 21.2, and an estimated absolute magnitude Mb ~ —25.2. The projected separation on the 
sky is 196 arcsec, and the redshift difference Az = 0.063 ± 0.008. The probability of finding this 
quasar pair by chance in the absence of clustering in this particular volume is ~ 10~ 4 - 10~ 3 . 
We conclude that the two objects probably mark a large-scale structure, possibly a protocluster, 
at z ~ 5. This is the most distant such structure currently known. Our search in the field of 
13 other QSOs at z > 4.8 so far has not resulted in any detections of comparable luminous 
QSO pairs, and it is thus not yet clear how representative is this structure at z ~ 5. However, 
along with the other evidence for clustering of quasars and young galaxies at somewhat lower 
redshifts, the observations are at least qualitatively consistent with a strong biasing of the first 
luminous and massive objects, in agreement with general predictions of theoretical models. More 
extensive searches for clustered quasars and luminous galaxies at these redshifts will provide 
valuable empirical constraints for our understanding of early galaxy and structure formation. 

Subject headings: cosmology: observations - galaxies: formation - quasars: general - quasars: 
individual (SDSS 0338+0021, RD 657) 



1. INTRODUCTION 

Quasars at large redshifts represent a powerful 
probe of structure formation in the early universe. 
Dissipative mergers and tidal interactions during the 
early stages of galaxy assembly may be fueling both 
bursts of star formation and early AGN activity (see, 
e.g., Silk k. Rees 1998, Franceschini et al. 1999, 
Monaco et al. 1999, Kauffmann & Haehnelt 2000, 
Granato et al. 2001, etc.). This fundamental con- 
nection is supported by the remarkable correlations 
between the masses of central black holes in nearby 
galaxies, and the velocity dispersions and luminosi- 
ties (~ masses) of their old, metal-rich stellar pop- 
ulations (Magorrian et al. 1998, Ferrarese & Merritt 
2000, Gebhardt et al. 2000), and by the high metallic- 



ities observed in the high-z quasar spectra (Hamann 
&: Ferland 1999, and references therein). Quasars 
can also be used directly to probe evolution of large- 
scale structure out to high redshifts, as demonstrated 
clearly by the 2QZ survey (Croom et al. 2001, Hoyle 
et al. 2002). 

There should also be a fundamental connection be- 
tween the formation of galaxies and the large-scale 
density field in the early universe. The highest den- 
sity peaks, where presumably the first luminous ob- 
jects formed, should be strongly clustered, due to bi- 
asing (Kaiser 1984). This is a generic and robust 
prediction for essentially every model of structure for- 
mation, independent of any astrophysical details of 
galaxy formation. 

Luminous high-z quasars are likely situated in mas- 
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sive hosts (Turner 1991). Such massive halos should 
be rare, and may be associated with ~ 4 to 5-a peaks 
of the primordial density field (Efstathiou & Rees 
1988, Cole & Kaiser 1989, Nusser & Silk 1993). High- 
z quasars can thus be used as biased tracers of the 
early large-scale structure, possibly marking the cores 
of future rich clusters. 

A search for protoclusters around known high-z ob- 
jects such as quasars thus provides an important test 
of our basic ideas about the biased galaxy forma- 
tion. We have conducted a search for clustered pro- 
togalaxies and AGN in the fields of selected quasars 
at z > 4. Preliminary results from our program have 
been described by Djorgovski 1999 and Djorgovski et 
al. 1999, and a complete account will be presented 
elsewhere. 

Here we present the discovery of a clustered quasar 
pair at z pa 5, which we interpret as a signature of a 
primordial large-scale structure, possibly a core of a 
forming rich cluster. This is the most distant large- 
scale structure currently known. One of the quasars, 
SDSS 0338+0021, was discovered by Fan et al. 1999; 
the other was found in our much deeper search in its 
vicinity. 

2. OBSERVATIONS AND DATA REDUCTIONS 

Deep images of the field were obtained at the Palo- 
mar 200-inch Hale telescope using the prime-focus 
Cosmic imager (Kells et al. 1998). The instrument 
FOV is 9.7 arcmin square, with 0.286 arcsec pixels. 
Multiple dithered exposures totalling 1800 s, 3000 s, 
and 7200 s were obtained in Gunn r, i, and z filters 
respectively on 29 and 30 November 2000 UT, and 
of 3600 s, 1800 s, and 2400 s in Gunn g, r, and z 
filters respectively on 31 December 2000 UT, all in 
good conditions. The data were reduced using stan- 
dard procedures. Limiting magnitudes (3-cr in a 3 
arcsec apertures) are ru m pa 25.9, iu m pa 25.5, and 
zii m pa 23.1 mag in the SDSS (~ AB V ) photomet- 
ric system. Multi-color selection was used to identify 
candidates for objects at z > 5, as illustrated in Fig. 
1. Details of the data reduction and candidate selec- 
tion procedure will be presented elsewhere. 

The first set of imaging data was used to select 
candidates for multi-slit spectroscopy, and spectra 
of several objects, including the known QSO (SDSS 
0338+0021), were obtained at the WMKO Keck-I 10- 
m telescope on 29 December 2000 UT in good condi- 
tions. We used the LRIS instrument (Oke et al. 1995) 
with a 400 lines mm" 1 grating (Ablaze = 8500 A) and 
a GG495 order sorting filter, through 1.2 arcsec wide 
slitlets, with a mean dispersion in the wavelength re- 
gion of interest of pa 1.86 A pixel -1 . Two exposures 
of 1800 s each were obtained through a single slit- 
mask at a PA = 340°, close to the parallactic angle 



at the time. Exposures of arc lamps were used for 
wavelength calibration, with the flexure shifts cor- 
rected using the measurements of selected night sky 
lines. An average of archival response curves for this 
grating was used for the flux calibration. 

Only one high-priority color-selected candidate, 
which we designated RD 657 (for "red dropout" and 
a serial number in our CCD object catalog), could be 
accommodated on this slitmask. It turned out to be 
a quasar at z pa 5. The position of this object (J2000) 
is: 

a = 03 h 38 m 30.03 s 5 = +00° 18' 40.4", 

giving the projected separation from SDSS 
0338+0021 of 196 arcsec. Setting the photometric 
zero-points on the magnitudes of SDSS 0338+0021 
published by Fan et al. 1999, i.e., r = 21.70, 
i = 19.96, and z = 19.74, the magnitudes of the 
new QSO in the same SDSS system are r = 23.01, 
i = 21.24, and z = 21.16 mag, with estimated un- 
certainties of ~ 0.1 mag, making it about 3.5 times 
fainter than the SDSS QSO. Fig. 2 shows the finding 
chart for the field, with both quasars marked. 

Additional long-slit spectra of both objects were 
obtained using LRIS on 31 December 2000 UT in 
good conditions. We used a 600 lines mm" 1 grating 
(Ablaze = 7500A) through a 1.0 arcsec wide slit, cov- 
ering the wavelength range A ~ 4900 — 7450 A, with a 
mean dispersion in the wavelength region of interest 
of pa 1.255 A pixel -1 . Two exposures of 1200 s were 
obtained for SDSS 0338+0021, and four exposures of 
1200 s for RD 657, with the object dithered on the 
slit between the exposures. The slit PA was always 
close to the parallactic angle. Arc lamps were used 
for wavelength calibration, with the flexure shifts cor- 
rected using the measurements of selected night sky 
lines, and exposures of standard star Hiltner 600 were 
used for the flux calibration. 

We rebinned both grating data sets to a common 
sampling grid of 2 A pixel -1 , smoothed with Gaus- 
sians with a = 2 A (lower than the optimal smooth- 
ing filter for these data, thus resulting in no loss of 
information). Fig. 3 shows the combined spectra of 
the two objects. The redshifts determined from the 
emission lines (taking into account the absorption of 
the blue side of Lya) are z e = 5.02 ± 0.02 for SDSS 
0338+0021, and z e = 4.96 ± 0.03 for RD 657, with 
the large uncertainties due to the intrinsic difficulty 
of centering of broad emission lines. We also note the 
presence of a Lyman limit system at zlls = 4.99 in 
the spectrum of the brighter QSO. 

While absolute values of the redshifts cannot be 
measured very precisely, we used a simple cross- 
correlation to evaluate the redshift difference be- 
tween the two objects. We excluded the portions 
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of the spectra blueward of the centroids of the Lya 
lines, since the differences in the IGM absorption be- 
tween the two lines of sight could significantly affect 
the results. We obtain Az = 0.063 ± 0.008, which 
corresponds to the restframe velocity difference of 
At; = 3150 ±400 km s" 1 (these are the formal errors; 
possible systematic errors due to the mismatch of the 
spectra are hard to estimate precisely, but could be 
of the same order) . Looking for correlated absorption 
systems in the two spectra would require data with a 
higher S/N and a higher resolution. 

We obtained preliminary spectra of a number of 
other, fainter, color-selected candidates in the field. 
While the results are still inconclusive, none of them 
are luminous AGN. 

3. DISCUSSION AND CONCLUSIONS 

In what follows, for the sake of consistency with 
the previous work we will use the "standard quasar 
cosmology" with Hq = 50 km s _1 Mpc^ 1 , = 1, 
and Ao = 0. 

In this cosmology, the observed angular separa- 
tion of 196 arcsec corresponds to a projected spatial 
separation of 1.126 proper Mpc, or 6.744 comoving 
Mpc at (z) = 4.99. If we assume that the observed 
Az = 0.063 ± 0.008 is due entirely to cosmological 
expansion, the radial separation is 4.27 ±0.56 proper 
Mpc (25.6 ± 3.6 comoving). Quadratic sum of the 
two suggests a spatial separation of the two quasars 
of 4.4 ± 0.6 proper Mpc (26.5 ± 3.6 comoving). How- 
ever, this interpretation of the observed redshift dif- 
ference as being due only to the Hubble expansion is 
uncertain. If we did not have any radial separation 
information, we could estimate the spatial separation 
from the projected separation alone: statistically, for 
a pair randomly oriented in space, the ratio of the 
two is 7r/2, so that the most probable spatial separa- 
tion corresponding to our projected separation is 1.77 
proper Mpc (10.6 comoving). In the discussion below 
(which is similar to that by Schneider et al. 2000) we 
will consider both of these possibilities, i.e., comoving 
separations of 26.5 Mpc (the cosmological expansion 
model) and 10.6 Mpc (the deprojection model). 

Individual quasars represent rare events in the gen- 
eral population of galaxies at any redshift; how likely 
is to have two of them so close together? In order 
to estimate the probability of finding such a close 
quasar pair by chance at this redshift, we use the 
evolving QSO luminosity function (QLF) by Fan et 
al. 2001. For SDSS 0338±0021, we adopt the ab- 
solute blue magnitude Mb = —26.56 from Fan et 
al. 1999. Using the mean i and z magnitude differ- 
ence of ~ 1.35 mag, we estimate Mb = —25.2 for 
RD 657. The Fan et al. 2001 QLF gives a num- 
ber density of 4.35 x 10~ 8 Mpc~ 3 for quasars with 



Mb < —25.2 at this redshift. The comoving volumes 
enclosed by spheres with radii equivalent to the phys- 
ical separations of the quasars of 10.6 and 25.6 Mpc 
are 4.97 x 10 3 Mpc 3 and 7.01 x 10 4 Mpc 3 , respectively. 

The first question we can ask is, what is the a priori 
probability of finding such a close pair of QSOs at this 
redshift, regardless of the particular survey strategy? 
Assuming a Poissonian distribution of quasars, the 
probabilities of finding two QSOs at these luminosi- 
ties in these volumes are 2.3 x 10~ 8 and 4.6 x 10~ 6 , 
respectively. A similar reasoning was applied to the 
two serendipitously discovered quasar pairs at z > 4 
(Schneider et al. 1994a, Schneider et al. 2000). 

However, the volume in which we found this pair 
was not selected at random: it is centered on a pre- 
viously known QSO. We can thus ask an alternative 
question, which is specific for our experiment, namely, 
what is the probability that another QSO is found at 
random in this particular volume? (We note that the 
same answer would apply whether or not there is an 
already known QSO in its center.) The probabilities 
then become 2.2 x 10~ 4 and 3.0 x 10~ 3 , respectively. 
Thus, it is still unlikely that this pair represents a 
chance occurence, suggesting that there is some phys- 
ical clustering present. 

We note that as of early 2003, we observed a to- 
tal of 14 fields of QSOs at z > 4.8 covering the FOV 
~ 25 arcmin diameter (~ 52 comoving Mpc in the 
cosmology used here) to a comparable depth. Our 
spectroscopic follow-up is still incomplete, and thus it 
would be premature to include this additional volume 
in the present computation, but to date no other cases 
of comparably bright QSO pairs have been found, 
suggesting that this system must be a relatively rare 
event. A full analysis and estimates of the QSO clus- 
tering and bias will be presented in a future paper, 
once the survey is complete. 

We also note that in a deeper Keck survey of ~ 20 
QSOs at z ~ 4 — 4.7, but covering a smaller FOV, 
^6x8 arcmin 2 (~ 12 x 16 comoving Mpc 2 ) we found 
at least 2 AGN companions to the known, bright 
QSOs, with sub-Mpc separations (not gravitational 
lenses), as well as a number of clustered faint galaxy 
companions (Djorgovski 1999, Djorgovski et al. 1999, 
and in prep.). 

The clustering strength cannot be meaningfully 
measured from a single pair of objects in a survey of 
as yet poorly defined coverage. With this caveat in 
mind, the small probability of a random occurrence of 
such a pair implies an effective ro which could be con- 
siderably greater than the observed pair separation, 
i.e., ro >> 10 comoving Mpc. At low redshifts, there 
is some spread or results between different groups 
(see, e.g., Boyle k Mo 1993, Croom k Shanks 1996, or 
Sabbey et al. 2001), but most authors find that the 
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observed clustering length of quasars is comparable 
to that of galaxy groups, tq ~ 10 — 20 Mpc (Bahcall 
& Chokshi 1991; see Hartwick &; Schade 1990 for a 
review and references). A standard parameterization 
of the evolution of clustering in comoving coordinates 
is given by the formula 

ar,z) = (£) 7 (l + z)-( 3 -^) 

where 7 ~ 1.8 and e is the evolution parameter. 
For our chosen cosmology, the expected value for the 
CDM cosmogony is e « 0.8, and this is consistent 
with observations of the evolution of galaxy clustering 
at z < 1 (see, e.g., Carlberg ef al. 2000). Thus, one 
expects a strong decrease in the clustering strength 
at higher redshifts, and in any model gravitational 
clustering is always expected to grow in time. How do 
we then explain the apparent increase in the strength 
of quasar clustering at high redshifts? 

The most natural explanation is that quasars repre- 
sent highly biased peaks of the density field, and that 
the bias itself evolves in time. Ever since the first de- 
tections of QSO clustering (e.g., Shaver 1984, Shanks 
et al. 1987, Iovino & Shaver 1988, Mo & Fang 1993, 
etc.) it was considered possible that QSOs represent 
biased tracers of the density field, but the evolution of 
such bias was not clear. La Franca et al. 1998 found 
a turn-up in the clustering strength of quasars even 
at redshifts as low as z ~ 2, but this was not con- 
firmed in a much larger sample by Croom et al. 2001. 
Outram et al. 2003 find no evidence for an increase 
in the QSO clustering power spectrum amplitude out 
toz~ 2.2. 

The first hints of such an effect at high redshifts 
were provided by the three few-Mpc quasar pairs at 
z > 3, found in the statistically complete survey 
by Schneider et al. 1994b, as first pointed out by 
Djorgovski et al. 1993, and subsequently confirmed 
by more detailed analysis (Kundic 1997, Stephens et 
al. 1997). Intriguingly, the frequency of the few-Mpc 
separation quasar pairs at lower redshifts is roughly 
what may be expected from normal galaxy cluster- 
ing (Djorgovski 1991; see also Zhdanov & Surdej 
2001). There is even a hint of a possible superclus- 
tering of quasars at z > 4, on scales ~ 100 hr 1 co- 
moving Mpc (Djorgovski 1998), comparable to the 
scales of the first Doppler peak in CMBR fluctua- 
tions. Observations of large numbers of field galaxies 
at z ~ 3 — 3.5 also show a relatively strong cluster- 
ing, with ro ~ 5 — 10 Mpc, comparable to the galaxy 
clustering at z ~ (Steidel et al. 1998, Adelberger et 



al. 1998); this is also almost certainly a manifestation 
of biasing. 

However, the bias should be even stronger at higher 
redshifts, and what is observed at z ~ 3 should be 
even more pronounced at z ~ 5: the earliest massive 
galaxies, including quasar hosts, should be strongly 
clustered. An example may be the possible grouping 
of Lya emitters at z ~ 4.86 in the Subary Deep Field 
(Ouchi et al. 2003, Shimasaku et al. 2003). Strong 
increase in biasing at high redshifts is also indicated 
in numerous theoretical studies, e.g., by Brainerd & 
Villumsen 1994, Matarrese et al. 1997, Moscardini 
et al. 1998, Blanton et al. 2000, Magliocchetti et 
al. 2000, Valageas et al. 2001, Basilakos & Plionis 
2001, etc. What these studies show is that a simple 
(ro,e) parameterization of the clustering evolution is 
inadequate, and that the evolution of the bias factor, 
b, plays a key role. The effective bias factor gener- 
ally increases with the redshift and the object mass 
(e.g., especially for the more luminous Lyman-break 
galaxies or the quasar hosts). For example, Croom 
et al. 2002 find a marginally stronger clustering for 
the brighter QSOs, which might be residing in more 
massive hosts, and thus be more biased. 

The chief uncertainty in our current understand- 
ing and interpretation of the structure evolution at 
high redshift, as indicated by luminous objects we 
can observe, is the evolution of bias. Observations of 
the clustering of quasars and galaxies around them at 
z ~ 4 — 5 and beyond can provide valuable empirical 
constraints in this endeavor. A better understanding 
of the primordial clustering of luminous sources is also 
important for models of the cosmic reionization (see 
Djorgovski et al. 2001 and references therein), and 
thus for the interpretation of CMBR fluctuations at 
high angular frequencies. The quasar pair described 
here may be indicative of the biased clustering at 
z ~ 5, and more extensive and deeper surveys will 
provide additional observational input for the models 
of galaxy and large-scale structure formation. 
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Fig. 1. — A color-color diagram for all objects detected in the P200 imaging field, with the two quasars labeled. 
The dashed line shows our color selection boundary. The two quasars are labeled. 
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Fig. 2. — A finding chart for the held, from the P200 i-band images. The field size is 150 by 300 arcsec, with 
North up and East to the left. The two quasars are labeled. 




Fig. 3. — Combined Keck/LRIS spectra of the two quasars, with the main features labeled. 



